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Technical Field Of The Invention 

[0001] The present invention is directed, in general, to sound recording and reproduction and, more specifically, 
to a resolution-independent system and method for making a reducing of a sound and later employing the record to 
synthesize the sound. 

Background Of The Invention 

[0002] Current recording of musical instrument performances are either based on sampling the analog signal of 
the instrument or recording the gestures that are input to a controller. This leads to playback situations where the 
performance may only be edited in a fixed time domain for the case of sampling, or a performance recording that may 
only realistically record keyboard and percussion in the case of Musical Instrument Digital Interface (MIDI). 
Additive and spectral synthesis technologies break down musical performances into discrete notes as opposed to a 
continuous performance. Generally, the ability to change the tempo or the key and to synchronize a performance with 
external events during playback are often difficult to accomplish at reasonable cost and without unacceptable 
distortions. 

[0003] The sound waveforms produced may be characterized by many parameters, including frequency and 
amplitude. Using Fourier analysis, sound waveforms may be represented in a frequency domain as a spectral frame, 
consisting of spectral components. The spectral frame contains the waveform's lowest, or fundamental, frequency, 
along with its harmonics (spectral components which occur at multiples of the fundamental frequency). Spectral 
components from string instruments and from vowels in speech typically occur at close to whole number multiples of 
the fundamental frequency, while spectral components from percussion instruments often occur at non-integral 
multiples of the fundamental frequency. 

[0004] Current sound recordings have been seen to be typically sample rate dependent or suffer from other 
recording and playback characteristics that make modifications to the record difficult to accomplish at acceptable 
costs or distortion levels. Additionally, these limitations make the offering of current sound recordings very 
limited in format for playback selection. Radio stations offer a selection of recordings that may be programmed for 
many days into the future with only an occasional specific request capability allowed, usually by telephone. Even in 
the case of selected requests, the recording is completely fixed in format with respect to. tempo and key as well as 
its basic arrangement. 
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particular sampling frequency. 

[0011] In one embodiment of the present invention, the system further includes a mapping circuit that applies a 
temporal quantization map to the record. Once the record is created, the present invention accommodates a wide range 
of conventional and later-developed sound manipulation techniques. . 
5 [0012] in one embodiment of the present invention, the system further includes an editor that modifies a 
selected one of a content and an order of the record. The editor allows still further manipulation of the sound once 
recorded. 

[0013] In one embodiment of the present invention, the frame analyzer identifies the selected one of common 
spectra structures and common formant structures by Fourier analyzing the frames. Those skilled in the art are 

10 familiar with, in particular, fast Fourier transform techniques by which frequencies may be analyzed. The present 
invention is compatible with other conventional or later-developed spectrum analysis techniques, such as wavelets. 
[0014] The present invention further provides infrastructures for distributing recordings for remote playback. 
One infrastructure includes: (1) a radio station having a recording database associated therewith, (2) a plurality 
of recordings contained within the recording database, each of the plurality of recordings including a selected one 

15 of common spectra structures and common formant structures corresponding thereto, (3) a request receiver, coupled to 
the recording database, that receives remote requests for ones of the plurality of recordings and (4) a transmitter, 
coupled to the recording database, that transmits the ones of the plurality of recordings in response to the requests. 
[0015] The present invention therefore provides what amounts to "audio-on-demand" wherein formatted audio files 
are provided to remote "radios" to allow the remote "radios" to synthesize the audio in situ. Therefore, in one 

20 embodiment of the present invention, the infrastructure further includes a plurality of remote radios capable of 
receiving and digitally manipulating the ones of the plurality of recordings. The remote radios may comprise 
software that can be downloaded and executed on data processing and storage hardware to allow the ones of the 
plurality of recordings to be played. This infrastructure sharply contrasts with conventional analog AM or FM radio 
infrastructures in which remote radios simply demodulate and amplify received radio waves. 

[0016] In one embodiment of the present invention, the transmitter broadcasts the ones of the plurality of 
recordings to receivers. Alternatively, the ones of the plurality of recordings may be addressed to individual 
remote "radios." 

[0017] In one embodiment of the present invention, the ones of the pluralities of recordings are embodied in a 
plurality of bitstream files. The bitstream files contain data pertaining to the fundamental frequencies and the 
30 selected one as described above. 

[0018] tn one embodiment of the present invention, the recording database contains a record of the requests. 

This allow? sono DODularitv or advertisement dissemination to be tracked and accurate royalty payments to be 
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[0021] Referring initially to FIGURE 1, illustrated is a block diagram 100 of a resolution-independent system 
for recording a musical instrument constructed' according to the principles of the present invention. The resolution- 
independent system of block diagram 100 includes a sound source 110, a sampler 120, a frame generator 130, a frame 
analyzer 140, a first storage unit 150, a mapping circuit 160, an editor 170 and a second storage unit 180. 
[0022] The present invention provides systems and methods for recording and synthesizing sound in a resolution- 
independent manner and infrastructures for distributing resolution-independent recordings for remote playback. In 
the present embodiment, a musical instrument may be used to generate the sound. Those skilled in the art are 
familiar with the formant content of certain musical instruments, such as string and wind instruments. Human voices 
also contain formants that may be captured and employed in later synthesis to be used in play-back of the recording. 
The present invention can operate with any sound, however. 

[0023] The sampling, which may occur at 1 ms intervals, is accomplished by the sampler 120 before the 
fundamental frequencies are extracted by the frame generator 130. However, those skilled in the art will understand 
that the present invention is not limited to a particular sampling frequency or the use of a separate sampler as 
shown. The sampler 120 may be included as part of the frame generator 130. This embodiment allows the data 
comprising a sound source to be independent of the sampling rate or tempo desired. The frame generator 130 creates 
frames and extracts fundamental frequencies and spectral envelopes from the sound source 110 through the sampler 
120. Then, the frame analyzer 130 identifies a selected one of common spectral structures and common formant 
structures in the frames and creates a record, containing this selected one and the appropriate fundamental 
frequencies, that is then stored in the first storage unit 1 50, 

[0024] The present invention therefore introduces the broad concept of storing fundamental frequencies and 
selected structures in sound and creating a record containing those fundamental frequencies and selected structures 
to provide a basis for subsequent synthesis to be used in playback. The present invention analyzes the sound as a 
continuous performance, irrespective of individual tones or notes. The frames may be discrete, allowing them to 
correspond to a discrete period of time. The frame analyzer 140 identifies the selected one of common spectral 
structures and common formant structures by Fourier analyzing the frames. Those skilled in the art are familiar with 
Fast Fourier Transform (FFT) techniques as one technique by which frequencies may be analyzed. The present 
invention is compatible with other conventional or later-developed spectrum analysis techniques, such as wavelets, 
as well. 

[0025] In this embodiment, common spectral or formant structures may be contained in a dictionary in order 
to compress the total size of the record. The identification and grouping of common spectral or formant structures 
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in a step 215 before extracting the fundamental frequencies from the sampled sound signal. Frames, that may be 
discrete, are then generated in a step 220 from which fundamental frequencies and spectral envelopes are then 
extracted. These frames are then analyzed in a step 225 using Fourier analysis, and then common spectral and formant 
structures are identified in a step 230. These common structures are then stored in a step 235 which creates a 
5 record of the common structures. A temporal quantization map may then be applied to the record as shown in a step 
240, or the record may be modified selecting one of a content and an order of the record in a step 245 in order to 
edit the contents as required. The method ends in an end step 250 where the sound has been selected, sampled, 
framed, analyzed, recorded, mapped or modified in this embodiment 

[0030] Turning now to FIGURE 3, illustrated is a block diagram 300 of a resolution-independent system for 
w synthesizing a recorded musical instrument constructed according to the principles of the present invention. The 
resolution-independent system of block diagram 300 includes a storage unit 305, a mapping circuit 310, an editor 315, 
a waveshaper 320, an output device 325 and a speaker 330. The storage unit 305 contains the records and dictionaries 
that have been created by sampling, framing and analyzing the sound source as discussed in FIGURE 1 and FIGURE 2 
above. 

[0031] The mapping circuit 310 applies a temporal quantization map to the record. As stated earlier, the present 
invention may accommodate a wide range of conventional and later-developed sound manipulation techniques allowing 
the record to be synthesized with a changed tempo or key and provide the ability to synchronize the playback 
performance with an external clock. Further manipulation of the recorded sound is provided with the editor 315 which 
may modify the content or the order of the record to provide effects not contained in the original sound source. The 
editor 315 may also be used to re-arrange the record sequence in time relative to the frequency contour as discussed. 
[0032] The waveshaper 320 is coupled to the storage unit 305, the mapping circuit 310 and the editor 315. The 
waveshaper 320 takes the fundamental frequencies and applies a waveshaping transfer function to create a waveform 
from either the stored record, the mapped record or the edited record. The waveshaper 320 may also use some 
combination of these three in order to generate the waveform. The waveshaper 320 may also select from a number of 
waveshaping transfer functions that are stored in the waveshaper 320 to accommodate the waveshaping process. The 
waveshaper 320 is clocked externally, in this embodiment, allowing the waveform to be synchronized with external 
events. The waveform may then be converted into an output sound using the output device 325 and the speaker 330. 
The synthesis process represented here allows the originally recorded sound to be reproduced with appropriate 
fidelity or allows the originally recorded sound to be modified as deemed appropriate to the user. This does not 
preclude the use of other synthesis techniques such as FFT or direct sine. 

[0033] Turning now to FIGURE 4, illustrated is a flow diagram 400 of a resolution-independent method of 
synthesizing a recorded musical instrument that may be carried out in the system of FIGURE 3. The method depicted in 
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synthesize the audio in situ. In this embodiment of the present invention, the infrastructure further includes a 
plurality of remote digital radios capable of receiving and digitally manipulating the plurality of recordings. This 
infrastructure sharply contrasts with conventional analog AM or FM radio infrastructures in which remote radios 
simply demodulate and amplify received radio waves. This infrastructure may function with any currently-proposed or 
later-developed digital transmitter and receiver standards. The prograr material for the recordings may include but 
is not limited to weather, news, stock quotes or other topical information. 

[0039] The random access playlist 515 represents the pluralities of recordings, which are embodied in a 
plurality of bitstream files. The bitstream files contain data pertaining to the fundamental frequencies and the 
selected one as described above. The bitstream files, which may represent a collection of selections or the 
collection of offerings, may occur in a single serial loop. The user may select the ones of these that are 
downloaded and played. Alternately, they may occur in a collection of parallel loops allowing the user to perform 
the download 520 more rapidly. The transmitter, associated with the wireless server 505, may broadcast the ones of 
the plurality of recordings to all remote radios. Alternatively, the ones of the plurality of recordings may be 
addressed only to individual remote radios. The recording database may contain a record of the requests. This allows 
song popularity or advertisement dissemination to be tracked and accurate royalty payments to be calculated 
automatically. 

Claims 

1. A method of recording sound, for example sound produced by a musical instrument, comprising: 
extracting fundamental frequencies and spectral envelopes from said sound; 
creating frames from said fundamental frequencies and spectral envelopes; 

identifying a selected one of common spectra structures and common formant structures in said frames; and 
creating a record containing said fundamental frequencies and said selected one. 



2. The method as recited in claim 1 wherein said frames are discrete. 
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a transmitter, coupled to said recording database, that transmits said ones of said plurality of recordings 
in response to said requests. 

9. The infrastructure as recited in claim 8 wherein said transmitter broadcasts said ones of said plurality of 
recordings to receivers. 

10. The infrastructure as recited in claim 15 further comprising a plurality of remote radios capable of receiving 
and digitally manipulating said ones of said plurality of recordings. 

11. The infrastructure as recited in any of claims 8 to 10 wherein said ones of said pluralities of recordings are 
embodied in a plurality of bitstream files. 

12. The infrastructure as recited in any of claims 8 to 11 wherein said recording database contains a record of 
said requests. 

13. A radio, comprising: 

a receiver for receiving a recording including fundamental frequencies and a selected one of common spectra 
structures and common formant structures corresponding thereto; 

a waveshaper, coupled to said receiver, for applying a waveshaping transfer function based on said selected 
one to said fundamental frequencies to create a waveform; and 

a speaker, coupled to said waveshaper, for converting said waveform into an output sound. 

14. The radio as recited in claim 13 further comprising a mapping circuit, coupled to said receiver, that applies a 
temporal quantization map to said selected one. 

15. The radio as recited in claim 13 or claim 14 wherein said waveshaping transfer function is selected from a 

nhjrplitv of wavpshgnino transfer functions stored in said waveshaDer.. 
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FIG. 2 
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FIG. 3 
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FIG . 4 
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FIG. 5 A 
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